Attribute-based Mandarin speech recognition using conditional random fields
نویسندگان
چکیده
Integrating phonetic knowledge into a speech recognizer is a possible way to further improve the performance of conventional HMM-based speech recognition methods. This paper presents a cascaded architecture which consists of attribute detection and conditional random field to make use of phonetic knowledge within the phone decoding process. The attribute detection can be implemented by using any effective feature extraction approaches. In this study, an HMM-based method is applied for attribute tagging of Mandarin speech. Then a conditional random field method which applies attribute labels as the input vectors is used to perform the speech recognition. The preliminary experiment result shows that the proposed method is very promising and worthy for further investigation.
منابع مشابه
On-line Mandarin Phonetic Symbol Recognition for Video-based Fingertip Input System, " Revised in Journal of Visual Communication and Image Representation
[1] Wei-Tyng Hong, “Hidden Conditional Random Fields for Resource-constrained Speech Recognition”, Advanced Science Letters. (accepted, 2011) (EI, SCI) [2] Wei-Tyng Hong, “An Investigation on Robust Confidence Measure and Model Compensations for Smartphone-based Speech Recognition”, International Journal of Advanced Information Technologies. (accepted, 2011) [3] Wei-Tyng Hong, “Text-independent...
متن کاملMinimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
Hidden conditional random fields (HCRFs) are derived from the theory of conditional random fields with hidden-state probabilistic framework. It directly models the conditional probability of a label sequence given observations. Compared to hidden Markov models, HCRFs provide a number of benefits in the acoustic modeling of speech signals. Prior works for training on HCRFs were accomplished with...
متن کاملMonotone string-to-string translation for NLU and ASR tasks
Monotone string-to-string translation problems have to be tackled as part of almost all stateof-the-art natural language understanding and large vocabulary continuous speech recognition systems. In this work, two such tasks will be investigated in detail and improved using conditional random fields, namely concept tagging and grapheme-to-phoneme conversion. Concept tagging is usually one of the...
متن کاملImproving Large Vocabulary Accented Mandarin Speech Recognition with Attribute-Based I-Vectors
It has been well-recognized that the accent has a great impact on the ASR of Chinese Mandarin, therefore, how to improve the performance on the accented speech has become a critical issue in this field. The attribute feature has been proven effective on modelling accented speech, resulting in a significantly improved performance in accent recognition. In this paper, we propose an attribute-base...
متن کاملDetection-based ASR in the automatic speech attribute transcription project
We present methods of detector design in the Automatic Speech Attribute Transcription project. This paper details the results of a student-led, cross-site collaboration between Georgia Institute of Technology, The Ohio State University and Rutgers University. The work reported in this paper describes and evaluates the detection-based ASR paradigm and discusses phonetic attribute classes, method...
متن کامل